THE STRUCTURE THEORY OF SET ADDITION REVISITED 



TOM SANDERS 

Abstract. In this article we survey some of the recent developments in the structure 
theory of set addition. 



1. Introduction 

The purpose of this survey is to review some recent advances in Freiman's theorem, one 
of the central results in what is called the structure theory of set addition. This theory 
was first systematically developed by Freiman in Fre66l IFre73aj and a large part of it is 
concerned with the question 'what do approximate groups look like?' 

In fact we shall be interested in what Abelian approximate cosets of sub-groups look 
like. To craft a more concrete question it is useful to have some notation: suppose, as it 
shall be throughout, that G is an Abelian group. Given A,A'<^G we write A + A' for the 
sumset of A and A' which is defined by 

A + A' := {a + a' : a e A, a' e A'}. 

Given this it is easy to check that a subset W of G is a coset (of a subgroup) if and only if 

W *0 and |W + W| = \W\. 

As indicated we are interested in approximate cosets and to this end we relax these re- 
quirements so that they are only approximately true. Relaxing the first requirement does 
not lead to an interesting generalisation; for the second we ask that the sumset be 'not 
much larger' than the original set. To be clear given K > 1 we say that A (non-empty) 
has doubling^ K if \A + A\ < K\A\ and are interested in which sets have this property. 

We shall be interested in the case when the doubling is small and to get a sense of what 
this means it is worth noting that trivially every set has doubling \A\ since there cannot be 
more elements in A + A than there are pairs in A 2 . (In fact this can trivially be improved 
to (\A\ — l)/2 but our interest at this stage is really in orders of magnitude.) 

It may be instructive on a first read to think of K = 0(1) as \A\ —*■ oo, although it will 
turn out later that we can allow K to grow (slowly) with \A\. 

If A is a coset then A has doubling 1 which is certainly small, but are there any other 
sets with small doubling? One way to create such sets is to take a large subset of a coset. 
In particular, suppose that W is a coset in G and A a W is such that \W\ < A|A|. Then 
since A + A <^ W + W we conclude that A has doubling K. 



-^One might very reasonably suggest that one use the phrase 'doubling ratio' instead of 'doubling' here. 
While this would be sensible, this is not the terminology in use in the subject and to maintain consistency 
with existing literature we shall follow the standard terminology. 
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It turns out that if K is small enough then the above construction is characteristic in the 
sense that it is the only way to create sets with doubling K. This was proved by Freiman 
in [Fre73bj and appears as |TV06t Exercise 2.6.5]. At this point it is worth remarking that 
the book |TV06j of Tao and Vu is the standard text for many of the better known aspects 
of the material we shall be discussing and, where possible, we have given references to that 
alongside the original source. 

Proposition 1.1. Suppose that \A + A\ < K\A\ for some K < 1.5. Then there is a 
subgroup H of size at most K\A\ such that A is contained in a coset of H . 

(For the unfamiliar it may be worth saying that this result is not the Freiman's theorem 
we shall ultimately be interested in.) 

The result shows that the only way of creating sets with doubling less than 1.5 is the 
method described before the proposition and, moreover, every set created in that way has 
doubling less than 1.5: the result characterises sets with doubling less than 1.5. 

There is a good reason for the limitation of 1.5 above and that is because there is a 
qualitatively new way of constructing sets with small doubling. Suppose that H ^ G, 
x + H g G/H has order 4 and put A := H u (x + H). Then a short calculation shows that 
| A + A\ = 1.5|v4| but any coset containing A has size at least 2\A\. 

Instead of taking large subsets of one coset our new construction takes unions of cosets 
(of the same subgroup). In light of this we introduce a new piece of terminolog}0: we say 
that a set A is /c-covered by B if there is a set X of size at most k such that Acl + B. 

One can combine the two ways of creating sets with small doubling by considering 
(disjoint) unions of large subsets of cosets (of the same subgroup) to produce more sets 
with small doubling, and it turns out that while the doubling remains less than 2 this is 
the only way of creating sets with doubling less than 2. 

Proposition 1.2. Suppose that \A + A\ ^ K\A\ for some K < 2 — e. Then there is a 
subgroup H of G such that \H\ < K\A\ and A is O e (l) -covered by H. 

Unlike Proposition 11.11 this result is not characteristic in that not every set satisfying 
the conclusion has doubling strictly less than 2. Indeed, the doubling of such sets may be 
much larger than 2. In fact a more precise characterisation of sets with doubling in this 
range is available in the form of Kneser's theorem ( |Kne53j or |TV06[ Theorem 5.5]) from 
which Proposition 11.21 follows via something called a covering argument of a sort we shall 
see later in §H 

The example to highlight the limitation of Proposition 11.11 was the first in a series of 
examples generated by longer and longer arithmetic progressions and these examples go 
some way to explaining why 2 should be a critical point in Proposition 11.21 Indeed, if 
G = Z and A is a finite arithmetic progression then A has doubling (2 — 1/|A|). If some 
version of the conclusion of Proposition II .21 were to hold without the dependence on e then 
we should need to cover A by 0(1) cosets of a subgroup H < G of size 0(|A|). Of course 
the only finite subgroup of Z is {0} and so this is not possible. 



2 This follows Green and Ruzsa |GR06j . and has been much popularised by Tao |Tao08j . 
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This last example shows us that if we are to have a hope of describing sets with doubling 
2 then we shall need to admit another form of structure: long arithmetic progressions. An 
arithmetic progression can be thought of as a discrete representation of an interval and in 
this light can be seen as a special case of a more general structure which, it turns out, also 
has small doubling: lattices in convex bodies. 

A centred convex progression is a set P in G, a symmetric convex body Q in M d and 
a homomorphism <fi : 7L d —*■ G such that 0(Z d n Q) = P. We say that P is d-dimensional 
and we shall usually simply talk about the set P with Q and (J) being implied (despite the 
fact that they are not necessarily well-defined). 

Given this definition a (symmetric) arithmetic progression is a 1-dimensional centred 
convex progression and all 1-dimensional centred convex progressions are (symmetric) 
arithmetic progressions. 

A convex body in M. d has doubling 2 d , and it turns out that this doubling property 
is inherited by (i-dimensional convex progressions in the sense that they have doubling 
exp(0(d)). The proof of this is not very difficult and can be done using a covering argument. 
The details are in Lemma 14.21 to avoid breaking the flow. 

Given a set of small doubling we can always create a new set with small doubling by 
adding a subgroup. In light of this we define a rf-dimensional centred convex coset 
progression to be a set of the form P + H where P is a (/-dimensional centred convex 
progression and if is a subgroup of G; this also has doubling exp(0(d)). (Again, see 
Lemma 14.21 for a proof. ) 

With this new type of structure we can set about constructing a large class of sets with 
small doubling (small here meaning 0(1)). In our earlier discussion we found two methods 
of producing sets with small doubling from subgroups: we could take large subsets and 
we could take a union of a small number of cosets. We now replace 'subgroup' in these 
constructions by 'centred coset progression'. 

Suppose that A is exp(<i)-covered by a (i-dimensional centred convex coset progression 
M of size at most exp(d)|A|. Then by definition there is a set X of size at most exp(cf) 
such that A a X + M whence 

(1.1) \A + A\ ^ \X + M + X + M\ ^ \X\ 2 \M + M\= exp(0(d))\A\, 

so that A has doubling exp(0(d)). Remarkably it turns out that the above is the only way 
of constructing sets of small doubling. 

Theorem 1.3 (Green- Ruzsa theorem; Freiman's theorem for Abelian groups). Suppose 
that \A + A\ < -K"|-<4|- Then A is exp(d(K))-covered by a d(K) -dimensional centred convex 
coset progression M of size at most exp(d(K))\A\. 

The result above was first proved by Freiman |Fre66j for the case of G torsion-free and 
later a new proof with better bounds was given (for the same setting) by Ruzsa in |Ruz94j . 
In |Ruz99] Ruzsa proved the result for groups of bounded exponent which is in some sense 
at the other end of the spectrum from torsion-free, and then Green and Ruzsa in |GR07j 
established the result above for arbitrary (Abelian) groups with another proof appearing 
a little later in [TV061 Theorem 5.43]. 
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While Theorem 11.31 resolves the qualitative question of the structure of sets with small 
doubling, the quantitative question remains and this is where most of the recent advances 
have been. In their first proof of Theorem 11.31 Green and Ruzsa showed that one may take 

d(K) = 0(K A+o{l) ). 

Various strengthenings were available at that time for torsion-free and groups of bounded 
exponent. (See, for example, |Cha02] or the appendix to |Bou08j for the torsion-free case, 
and [GT09b| for the bounded exponent case.) Unfortunately, all bounds were of the form 
d(K) = 0(K C ) for some C > 0, and it was seen as a significant open problem to show 
d(K) = Q(K °W). 

In [Schll] Schoen made a striking breakthrough proving a bound of the forn^ 

d(K) = 0(exp(0(v / hgK))), 

and then shortly after that Croot and Sisask came out with an important new argument 
in [CS10] which it turned out could be used to prove 

d{K) = O(log 3+O(1) K). 

Establishing this is one of the main goals of this survey; to be clear we shall prove the 
following version of Theorem 11.31 

Theorem 1.4 (Green-Ruzsa theorem, good bounds). Suppose that \A + A\ < Then 
A is exp(0(log 3+ °^ K))-covered by an 0(log 3+ ° ( - 1 - ) K)- dimensional centred convex coset 
progression M of size at most exp(0(log 3+0< - 1 ^ i<T)) 

This result with a power of 6 instead of 3 was shown in |Sanl0j . and in the basic 
framework of this paper that 6 improves to a 4. An improvement of the 4 to a 3 is the 
result of a wonderful iterative application of our basic tool which is due to Konyagin. 

For comparison the calculation in the construction before Theorem 11.31 turns out to be 
tight and it follows from this that d(K) = f2(logi^), and this is conjecturally the correct 
order of magnitude. To see this suppose that \A + A\ = K\A\ and note by the calculation 
in dUI]) that 

K\A\ ^exp(2d(K))exp(0(d(K)))exp(d(K))\A\ = exp(0(d(K)))\A\, 
from which the lower bound on d(K) follows. 

Conjecture 1.5 (Polynomial Freiman-Ruzsa conjecture). Suppose that A has \A + A\ < 
Then A is exp(0(log K)) -covered by an O '(log K)- dimensional centred convex coset 
progression M of size at most exp(0 (log K))\A\ . 

We have skipped over a number of the details in this introduction, but before moving 
on to a more careful discussion it is worth making a couple of remarks on why Freiman's 
theorem is important. 

3 This is our first use of logs in this survey, and they will appear a lot more. We shall always think of 
the argument as being larger than some constant, but if the reader does not wish to concern themselves 
with this then they may think of logo; as denoting log(2 + x). 



THE STRUCTURE THEORY OF SET ADDITION REVISITED 



5 



First there is a practical reason: as a result of the celebrated work of Gowers |Gow98[ 
IGowOlj in the late 90s Freiman's theorem has found a bevy of applications. For exam- 
ple, Gowers himself used it to spectacularly improve the bounds in Szemeredi's theorem; 
Szemeredi and Vu used it to investigate long arithmetic progressions in jSV06j ; Tao and 
Vu used it to investigate random matrices in |TV07] ; Schoen records many shorter con- 
sequences at the end of his paper |Schllj on Freiman's theorem; and Chang in |Cha09] 
collects together a number of other applications where good bounds would be particularly 
useful. There is some discussion of applications at the end of the paper in £ fl"3l 

Secondly there are good theoretical reasons, three of which we shall record now. They 
may not all make precise sense at this point in the article, but part of our hope is that we 
shall be able to go some way towards explaining them. 

(i) The hypothesis of the theorem is easily satisfied. In a sense we have seen that 
this is true empirically as a result of the many applications. From a theoretical 
perspective this is because convex coset progressions are ubiquitous in contrast to 
subgroups (in some groups). An example to bear in mind is G = 7LjpL for p a 
prime. This has a very poor subgroup structure, but since arithmetic progressions 
are convex coset progressions we see immediately that there is an abundance of 
convex progressions. 

(ii) A convex coset progression supports a lot of structure. While it is not a coset, 
it behaves enough like a coset that it can support many commonly used analytic 
arguments, and in particular a sort of approximate harmonic analysis. This means 
that many results for groups can also be established for convex coset progressions. 
The pioneering work here is that of Bourgain [Bou99j which was framed in a level 
of generality which includes convex coset progressions by Green and the author in 
[GS08] . 

(iii) Finally, the result is a rough equivalence: any set satisfying the conclusion of 
the theorem satisfies the hypothesis with K replaced by exp(0(d(K))). Thus the 
better the bound on the function d(K) the less loss there is in passing from the 
implicit algebraic data that a set has small doubling to the explicit algebraic data 
that it is generated from a convex coset progression. 

The paper now splits as follows. In the next section, §2], we describe the main plan of attack 
on Freiman's theorem which roughly splits it into two parts. The first part is covered in 
§SHH the second in §^9TTTT1 There is a concluding section in £1T2| and also a section on 
Pliinnecke's inequality in §3] which is a basic tool in the structure theory of set addition 
and has recently received a fantastic new proof by Petridis. 

2. Overview 

The proof of Theorem 11.41 splits naturally into two parts: one covers the more combi- 
natorial aspects, and one the more harmonic analytic aspects. This particular de-coupling 
can be said to originate with the work of Green and Ruzsa |GR07] . although their focus 
was much more on the second of the two, while the more recent improvements to the 
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bounds have arisen (largely) from more careful combinatorial analysis in the first part of 
the argument. 

The key definition is that of relative polynomial growth: to be clear we say that a set 
X has relative polynomial growth of order d if 

\nX\ < n d \X\ for all n e N. 

One might reasonably wish to insert a constant in front of the term on the right hand side, 
but we shall find that we are easily able to absorb this into the dimension at little cost to 
the quality of our eventual bounds. 

It is worth noting that having relative polynomial growth is a priori stronger than a 
small doubling condition. It will turn out later (see Proposition 15. ip that the conditions 
are qualitatively equivalent in that doubling K implies relative polynomial growth of order 
Ox(l), but quantitatively this equivalence entails an exponential loss and is the reason for 
the exponential weakness of the original arguments of Green and Ruzsa. 

With the definition above the argument splits into the following two parts. 

(i) (From small doubling to relative polynomial growth) Given a set A with \A + A\ < 
K\A\ we find a symmetric neighbourhood of the identity, X, (meaning that 
X = —X and 0g e X) of size at most 0^(|A|) with relative polynomial growth of 
order Ok {I) such that A is 0^(l)-covered by X. 

(ii) (From relative polynomial growth to convex coset progressions) Given a symmetric 
neighbourhood of the identity, X, with relative polynomial growth of order d we 
show that X is contained in an (^(^-dimensional centred convex coset progression 
of size at most 0^(1^1). 

Note that if we had proved these two statements then they combine to give Theorem II .31 
We now turn to look at these two parts in a little more detail. 

2.1. From small doubling to relative polynomial growth. The starting point here 
are the covering arguments of Ruzsa which will be developed in §U and which will be 
related to relative polynomial growth in §5J As we shall see there it is possible to use these 
covering arguments to show that if \A + A\ < K\A\ then A has relative polynomial growth 
of order 0(K A ) and from there it is a short step to the following corollary. 

Corollary 2.2. Suppose that \A + A\ < ^|^.|- Then A is 1-covered by a symmetric 
neighbourhood of the identity of size at most exp(0 (log K))\A\ and relative polynomial 
growth of order 0(K 4 ). 

This result is much weaker than we should like, but it turns out that it is essentially so 
because it provides a set which 1-covers. In £ fT2l we discuss an example of a set A with 
doubling K such that any set 1-covering it must have either relative polynomial growth 
of order Q(K) or size exp(f2(iT))|v4|. Thus to improve the bound on the order of relative 
polynomial growth we shall need to increase the covering number. 

In §6] we discuss a general framework for improving the above Corollary 12.21 before £J7] 
where we introduce a key new tool: the Croot-Sisask lemma. $7] includes the following 
result which can be seen as representing the state of the art prior to Schoen |Schllj and 
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Croot and Sisask |CS10] (although we shall use a special case of the Croot-Sisask lemma 
to prove it). 

Proposition 2.3. Suppose that \A + A\ < K\A\. Then A is exp(0(ii~ 1+o(1) )) -covered by 
a symmetric neighbourhood of the identity of size at most exp(0 (log K))\A\ and relative 
polynomial growth of order 0(K 1+o ^) . 

In SJB] we shall then make much more effective use of the Croot-Sisask lemma to show 
the following. 

Proposition 2.4. Suppose that \A + A\ ^ ^|^4|- Then A is exp (O (log 4 K))- covered by 
a symmetric neighbourhood of the identity of size at most exp(0 (log K))\A\ and relative 
polynomial growth of order 0(log 4 K). 

This result is where most of the more recent new material appears, but there is then 
also a combinatorial refinement following Konyagin which leads to our strongest result at 
the end of gS 

Proposition 2.5. Suppose that \A + A\ ^ K\A\. Then A is exp(0(log 3+o(1) K))-covered 
by a symmetric neighbourhood of the identity of size at most exp (0(log 1+o(1) K))\A\ and 
relative polynomial growth of order 0(log 3+ °^ K). 

2.6. From relative polynomial growth to convex coset progressions. To pass from 
relative polynomial growth to convex coset progressions it is useful to start by considering 
some examples of sets with relative polynomial growth. Of course, if Q is a convex body in 
M. d then fi(nQ) = n d fi(Q) for all n ^ 1 and so one expects that any (/-dimensional centred 
convex coset progression has relative polynomial growth roughly d (in fact d 1+0 ^'). 

Now, if P is a d- dimensional centred convex coset progression and X a P has size 
exp(-d 1+ °W)|P| then 

\nX\ ^ \nP\ < n \P\ = 0(n) \X\ = n \X\ for all n ^ 1. 

Crucially, though, a union of exp(rf 1+ °^^) translates of centred convex coset progressions 
will (generically) have relative polynomial growth of order exp(d 1+0 ( 1 ^) and not d 1+ °^ so 
that relative polynomial growth distinguishes between covering and containment in a way 
that doubling does not. 

It turns out that there is a matching result which tells us that essentially the only way 
of creating sets of relative polynomial growth is by the above method. 

Theorem 2.7. Suppose that X has relative polynomial growth of order d. Then there is a 
centred convex coset progression M such that 

X -X c M,dimM = 0(dlog 2 d) and \M\ ^ exp(0(dlog 2 d))\X\. 

The first thing to say is that the dimension here is tight up to factors of log d. This can 
be seen by, for example, letting X be the cube of side length iV in 7L d . This has polynomial 
growth of order Q(d) and any convex coset progression containing X has tripling at least 
2 d by the discrete Brunn-Minkowski inequality (see, e.g. |GT06t Lemma 2.4]) and so has 
dimension Q(d). 
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This result is the part of the argument which uses harmonic analysis and itself splits 
into a number of parts. These are covered in the second part of the paper starting at 

We should remark that Theorem II .41 follows immediately from Proposition 12.51 and The- 
orem [2T71 



3. Plunnecke's inequality 

This section is the final section before we plunge into the proof of Freiman's theorem 
and it will cover the invaluable tool of Plunnecke's inequality following the exciting new 
work by Petridis [Petllbl IPetllaj . The discussion in his papers is more comprehensive 
than ours and we direct the reader interested in more details there, but we hope to cover 
the salient features in what follows. 

Our starting point is the observation that given Freiman's theorem if \A + A\ ^ K\A\ 
then there is an Ox(l)-dimensional centred convex coset progression M of size Ox(l^-I) 
such that A is 0^(l)-covered by M. This means that there is some set X of size 0^(1) 
such that A a X + M. On the other hand as remarked in §2.61 the set M has relative 
polynomial growth of order 0^(1), hence 

\nA\ < \nX\\nM\ ^ \X\ n (n)° K{l) \M\ = K (l) n \A\ for all n e N. 

It turns out that a much stronger inequality is true: 

Theorem 3.1 (Plunnecke's inequality). Suppose that \A + A\ ^ ^|^4|- Then 

\nA\ ^K n \A\ for allneN. 

This result is due to Plunnecke's [Pfii69j and was rediscovered and greatly developed by 
Ruzsa |Ruz89j . Both Ruzsa and Plunnecke's arguments were graph theoretic and quite 
involved appealing to Menger's theorem (see |TV06t §6.5] for details). In |Petllb] Petridis 
removed the need for Menger's theorem and then a little later in [Pet 11a] he found a 
wonderful entirely new proof. 

The core of Petridis' argument is the next lemma. The idea is that if we are given sets A 
and X such that + ^ K\X\ then it is a good idea to pass to the 'best' possible subset 
of X. That is to say, to pass to the subset X' of X for which \A + X'\/\X'\ is minimal. 
Turning this around if X is already the 'best' subset then every X' a X has \A + X'\/\X'\ 
bigger than \A + In this case Petridis proved the following beautiful lemma. 

Lemma 3.2. Suppose that \A + X\ ^ K\X\ and \A + X'\ ^ K\X'\ for all X' c X. Then 
for all (finite) sets C we have 

\A + X + C\ ^ K\X + C\. 
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Proof. We iteratively decompose X + C into disjoint sets contained in translates of X: 
X + C = u c X c where X c c X + c. Writing Y c := (X + c)\X c we have 

\a+x + c\ ^j]\ A+x c\ = X!i^+(( x + c )vni 

c c 

^ 2(IA + x + cl-l^ + yl) 

c 

< 2(^ + c|-AT|y c |) = 2*W = K\X + C\. 

c c 

The result is proved. □ 

Given the idea of passing to this 'best' possible X the proof is rather natural, but the 
reader should make no mistake: the idea to do this is very nice and eluded many people! 
Petridis then gives the following immediate corollary. 

Corollary 3.3. Suppose that \A + B\ < K\B\. Then there is some non-empty X a B such 
that 

\nA + X\^ K n \X\ for all nsN. 

Proof. We can pick X a B such that is minimal over (non-empty) subsets of B. 

In this case A and X satisfy the hypotheses of Petridis' lemma and hence the conclusion. 
Applying the conclusion with C = (n — I) A we get that 

\X + nA\ ^ K\X + (n - 1)A\ for all jieN, 

and this gives the result (by induction). □ 

Note that Pliinnecke's inequality (Theorem 13. ip follows immediately from this applied 
to the set A and B = A since X a B = A. 

It is also possible to control jointly positive and negative sums of A using the following 
result called Ruzsa's triangle inequality |Ruz78] (see also |TV06[ Lemma 2.6]). 

Lemma 3.4 (Ruzsa's triangle inequality). Suppose that \A — B\ ^ K\B\ and \B — C\ ^ 
L\B\. Then 

\A + C\ < KL\B\. 

Proof. We consider the map B x (A + C) —> (A — B) x (B — C) defined by (6, s) >-> 
(a(s) — b, b — c(s)) where a(s) and c(s) are functions on A + C such that a(s) e A, c(s) e C 
and a(s) + c(s) = s. It is easy to check that our map on B x (A + C) is an injection: 
suppose that 

(a(s) ~b,b~ c(s)) = (a(s') - b' , b' - c(s')), 

then adding we get that s = a(s) + c(s) = a(s') + c(s') = s' and so s = s', and hence b = b'. 
It follows from this that \B\\A + C\ ^ \A — B\\B — C\ and we have the result. □ 

It may be intuitively helpful to know that this can be seen as the triangle inequality for 
a certain pseudo-metric one can define on sets (in groups) called the Ruzsa distance. (See 
[TV061 §2.3] for more details.) 
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As an immediate corollary of our work so far we have the so-called Pliinnecke-Ruzsa 
inequalities which are slightly more general than Pliinnecke's inequality. 

Corollary 3.5 (Pliinnecke-Ruzsa inequalities). Suppose that \A + A\ < A' | y4 1 . Then 

\nA - mA\ < K n+m \A\ for all n, m e N. 

Proof. Apply Corollary 13.31 to get a set X c A such that \rA + X\ < iT r |X| for all r e N so 
that in particular \nA + X\ ^ J\" n |X| and | — mi — X| ^ i^ m |X|. It follows from Ruzsa's 
triangle inequality that \nA — mA\ < K n+m \X\ < K" ,+m |yl| as required. □ 

4. Ruzsa's covering lemma 

Pliinnecke's inequality showed us how small doubling leads to small higher order sums. In 
[Ruz99j Ruzsa introduced another argument called a covering argument to the area which 
yields quantitatively similar order results to Pliinnecke's inequality but has the advantage 
of also providing a little structure. This covering argument is the topic of this section and 
will already give us a version of Freiman's theorem in groups of bounded exponent. We 
start with the basic lemma: 

Lemma 4.1 (Ruzsa's covering lemma, [TV061 Lemma 2.14]). Suppose that \A + S\ ^ K\S\. 
Then there is a set T a A with \T\ < K such that A a T + S — S . 

Proof. The technique here is very powerful so it is worth developing in some detail: we 
let T c A be maximal ^-separated. (The set T is S-separated if every pair of distinct 
elements t,t' e T have t + S and t' + S disjoint.) It follows that \T + S\ = \T\\S\. On the 
other hand, since T c A, we have T + S c A + S and so 

\T\\S\ = \T + S\ < \A + S\ < K\S\; 

we conclude that \T\ < K. 

Now we use the fact that T is maximal: if a e A then (either trivially if a e T or) 
by maximality there is some t e T such that (t + S) n (a + S) ^ 0- It follows that 
ast + S — S<^T + S — S and the result is proved. □ 

It should be remarked that this has an extension developed by Tao in [Tao08j giving 
a non-Abelian version of a (slightly weak) Pliinnecke inequality, although now Petridis' 
approach to Pliinnecke's inequality also yields a non-Abelian version of the (almost) full 
strength Pliinnecke inequality. 

Lemma 14.11 (or rather the technique used to prove it) can also be used to show that 
d- dimensional centred convex progressions have doubling exp(0(d)). 

Lemma 4.2. Suppose that M is a d- dimensional centred convex coset progression. Then 
\M + M\ < exp(0(d))\M\. 

Proof. To start with we write M = P + H for some centred convex progression P and 
Q a M. d for the convex body generating P. Given A e M >0 we write XQ for the set Q 
dilated by a factor A so that fi(XQ) = \ d fi(Q). 
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Now, let X a 2Q be a maximal iQ-separated set so that by the same argument as in 
Ruzsa's covering lemma we have 

2Q c X + l -Q - = X + l -Q and |X|4- d /i(Q) < (9/4) VQ). 
From the second of these it follows that |X| < 9 d . With the first we note that 

p + p = (j)(Q n z d ) + 0(Q n Z d ) c 0(2Q n Z d ) c (J cf)((x + -Q) n Z d ). 

xeX ^ 

Let T be a set such that if + \Q) n Z d ) ^ for some x s X, then T contains exactly 
one element of this set, so that |T| < \X\. Then if t' e </>((x + ~Q) n Z rf ) we have some 
i e T such that f-t'e 0(Q n Z d ) = P, whence PlPcTlP. Adding if we get that 
(P + H) + (P + H) a T + (P + #) since H + H = H, and the result follows given the 
bound on |X| (and hence \T\). □ 

One informative illustration of why Ruzsa's covering lemma is so powerful is given in 
Ruzsa's original paper |Ruz99] . 

Proposition 4.3 (Freiman-Ruzsa theorem for groups of bounded exponent). Suppose that 
G has exponent r and \A + A\ < Then (A), the group generated by A, has size at 

most K 2 r Ri \A\. 

Proof. The idea is simply to apply Ruzsa's covering lemma to 2A — A. By the Pliinnecke- 
Ruzsa inequalities we have that \(2A — A) + A\ = \3A — A\ < K 4 |A| and so there is a set 
T of size at most K 4 such that 

A+(A-A) = 2A-A^T + A-A. 

By induction it follows that nA + (A — A) c nT + (A — A) for all n e N. We write H for 
the group generated by T and note that \H\ ^ r' T ' and nT + A — A c H + A — A. We 
conclude that nA a nA + (A — A) a H + A — A for all n and similarly for — nA since H 
and A — A are symmetric. It follows that (A) a H + A — A and we get the result since 
\A-A\^K 2 \A\. □ 

Ruzsa has a further argument published in [DHP04] which improves the K 4 above to 
a K 3 using a slight refinement of the Pliinnecke-Ruzsa inequalities. A refined covering 
argument of Green and Ruzsa [GR06] gives the best known result following from covering 
techniques, while the best known upper bound by any argument is due to Schoen [Schllj 
who showed that the group generated by A has size at most r xl+ ° (1) \A\. 

It may also be worth noting that by letting A be 2K + 1 independent elements the upper 
bound is at least r 2i ^ +1 |y4| so that Schoen's result is tight up to the o(l)-term. (In the 
case when r = 2 this o(l)-term has been eliminated via some arguments from extremal set 
theory introduced by Green and Tao. We shall not pursue this here but see [GT09allKon08j 
and [Zohll] for details.) 
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5. Relative polynomial growth 



In §2] we made it clear that relative polynomial growth was going to be a key concept 
for us, and it arises naturally when we compare the results of §3] with those of §H as we 
shall now see. 

Suppose that \A + A\ ^ A|v4|. By Pliinnecke's inequality we have that 



On the other hand, by an inductive application of Ruzsa's covering lemma (as in the proof 
of Proposition l4~3l) we have that (n - I) A + {A - A) cz (n - 1)T + A - A for all n e N and 
some T of size at most A 4 . Now since G is Abelian we have 



for all neN. For small values of n this is much weaker than Pliinnecke's inequality but 
for large values of n, the estimate from Pliinnecke is exponential while this is polynomial. 

Proposition 14.31 does not adapt directly to the case of general Abelian groups because 
when G does not have bounded exponent we cannot expect the group generated by A to 
be finite (consider, for example, G = Z), but as we saw above it is sufficient to give relative 
polynomial growth. 

Proposition 5.1. Suppose that A cz G has \A+A\ ^ A|t4|. Then A has relative polynomial 
growth of order 0(A 4 ). 

We have an immediate corollary of this in the following. 

Corollary (Corollary 12. 2[) . Suppose that \A + A\ < A|yl|. Then A is l-covered by a 
symmetric neighbourhood of the identity of size at most exp(0(log A))|A| and relative 
polynomial growth of order 0(A 4 ). 

Proof. Since A has relative polynomial growth of order 0(A 4 ) by Proposition 15.11 we see 
by Ruzsa's triangle inequality that \n(A - A)\ < \nA - A\\ - A - nA\/\A\ = n 0( ~ K ^\A\ 
and so A — A also has relative polynomial growth of order 0(A 4 ). On the other hand 
\A — A\ < A 2 |A| and A — A is a symmetric neighbourhood of the identity which 1-covers 
A and so we are done. □ 

The weakness of this result is that it is exponentially expensive to apply: if A has relative 
polynomial growth of order d then A trivially has doubling at most 2 d . This means that if A 
has doubling K and we apply the proposition we get that A has polynomial growth of order 
0(K A ), but then we conclude that A has doubling at most exp(0(A' 4 )) - an exponential 
loss. Incidentally, this exponential loss is exactly the reason for the exponential loss in 
Green and Ruzsa's first version of Freiman's theorem. 

To deal with this situation we have a slight refinement of Ruzsa's covering lemma due to 
Chang |Cha02j . Chang observed that if a set has a sort of relative sub-exponential growth 



\nA\ < K n \A\ for all n e N. 




and so 



\nA\ < \(n - l)T\.\A - A\ < K 2 \(n - l)T\\A\ < n°^\A\ 
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on one scale then the covering set T in Ruzsa'c overing lemma can be made to be highly 
structured and hence get relative polynomial growth of much lower order on all scales. To 
be clear we need some notation: write 

Span(X) := {a.X : = J] a x x : a e {-1, 0, 1} X }. 

xeX 

Then we have the following result. 

Lemma 5.2 (A variant of Chang's covering lemma). Suppose that \kA + S\ < 2 k \S\ (and 
0g g A). Then there is a set T a A with \T\ < k such that A a Span(T) + S — S. 

Proof. Let T be a maximal S -dissociated subset of A, that is a maximal subset of A such 
that 

(a.T + S) n (a'.T + S) = for all a * a' e {0, 1} T . 

Now suppose that x' e A\T and write T' := T u {x'}. By maximality of T there are 
elements a, a 1 e {0, 1} T ' such that {a.T' + S) n {a'.T' + S) # 0. Now if a x , = a' x , then 
{a\r-T + S) n (ct'|t.T + S) ^ contradicting the fact that T is S'-dissociated. Hence, 
without loss of generality, a x i = 1 and a' x , = 0, whence 

x g a'\ T .T - a\ T .T + S-S<^ Span(T) + S - S. 

We are done unless |T| 5 s k; assume it is and let X" c T be a set of size fc. Denote 
{a.T' : a e {0, 1} T '} by P and note that P kA (since G e A), whence 

2 fc |,S| = \P + S\ ^ \kA + S\ < 2 k \S\. 

This contradiction completes the proof. □ 

Dissociativity is a very important concept in harmonic analysis and the relative version 
introduced in the above proof also have many uses. The reader interested in learning more 
is directed to |TV06t §4.5] or the book |Rud90] or Rudin. 

This result yields the following useful corollary. 

Corollary 5.3. Suppose that X c G is a symmetric neighbourhood and \(3k+l)X\ < 2 k \X\ 
for some k e N. Then X has relative polynomial growth of order 0{k). 

Proof. Apply Lemma 15.21 to the sets 3X and X to get a set T of size less than k such that 
3A" c= Span(T) + 2X. It follows that nX c {n - 2) Span(T) + 2X and so 

\nX\ < | (to - 2) Span(T)||2X| < (2to - 3) fe |2X| = 0{n) k \X\ 

provided n ^ 2. We conclude that X has relative polynomial growth 0{k) as required. □ 

This will often be combined with the following useful application of Ruzsa's covering 
lemma. 

Lemma 5.4. Suppose that X is a set of relative polynomial growth of order d and \ A+X\ ^ 
Then A is K -covered by X — X , a symmetric neighbourhood of the identity having 
relative polynomial growth of order 0(d). 
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Proof. We just apply Ruzsa's covering lemma to get that A is A-covered by X — X. This is 
a symmetric neighbourhood of the identity and \n(X — X) | < \X\ by Ruzsa's triangle 
inequality and the fact that X has relative polynomial growth of order d. The result 
follows. □ 

6. BOGOLYUBOV-RUZSA-TYPE LEMMAS 

In the last section we proved Proposition 15. II which converted our small doubling condi- 
tion into a relative polynomial growth of low order condition. As mentioned there this was 
not a particularly efficient process and so we set about proving Corollary 15.31 to do better. 
In this section we shall discuss a general framework for using this corollary. 

To start with suppose that X (is symmetric and) has \X + X\ ^ A|X|. By Pliinnecke's 
inequality we have that 

|(3fc + 1)X\ < K 3k+1 \X\ for aHk>l 

which is not smaller than 2 k (unless A is very small which is a case we have already 
discussed in the introduction). To get a sub-exponential estimate it will be useful to have 
a result of the following shape. 

Proposition 6.1 (Weak Bogolyubov-Ruzsa-type lemma). Suppose that X is symmetric 
with \X + X\ < K\X\ and m e N. Then there is a symmetric neighbourhood of the identity, 
T, such that 

\T\ = n m ,K (\X\) and mT c AX. 

Before remarking on the proof or history, we should see how such a result can be used to 
give a set with relative sub-exponential growth. Given X (symmetric) with |X+A| ^ A|A| 
we apply the lemma with some parameter m to get a set T as described. On the other hand 
by Pliinnecke's inequality with parameter I we have that \AIX\ < A' 4 '|X| and it follows 
that 

\mlT\ < K 4l \X\ = O m , K (K Al \T\) = ex V (0 K (l + O m {l)))\T\. 
At this point put 3k + 1 = ml and letting m —*■ oo very slowly with / we get 

|(3Jfe + l)T| = exp(o K (k))\T\. 

It follows that for k sufficiently large in terms of K the right hand side can be made to be 
at most 2 k and so Corollary 15.31 can be applied to the set T. Whether this turns out to 
be useful or not depends entirely on the quality of the lower bound in Proposition 16.11 and 
establishing results of that type with good bounds will be a major part of the remainder 
of the paper. 

Returning to the history, in the case when X is a thick set (meaning |X| = 
Proposition 16.11 follows from work of Bogolyubov Bog39| . Ruzsa in [Ruz94j introduced 



results of this type to Freiman's theorem, and the above Proposition does follow from his 
work. The difference here is that both Bogolyubov and Ruzsa prove stronger statements, 
in particular showing that the set AA contains a low dimensional Bohr set (see $9] for a 
definition); the set T can then be identified as a 1/m-dilate of this Bohr set. 
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The structurally weaker version of the Bogolyubov-Ruzsa lemma which we need here is 
fortunately rather easier to prove and results in stronger bounds. Since our objective is 
one of bounds this works out well. 

7. The Croot-Sisask lemma 

One of the key recent tools which has made advances in Freiman's theorem possible is 
called the Croot-Sisask lemma. This was first proved by Croot and Sisask in [CS10] and 
then refined by Croot, Laba and Sisask in jCLSllj . The aim of this section is to give a 
proof of the Croot-Sisask lemma and then immediately give an application to Freiman's 
theorem. 

Before starting we shall need a little notation. As we are interested in sumsets it will 
not come as too much surprise that we should be using the convolution of functions. First, 
recall that for p e [1, oo) the space £ P (G) is the space of functions / : G —*■ C endowed with 
the norm 

WflHG) ■■= (Sl/(^)I P 

\xeG 

For infinity we take the usual convention that 

|/|<"(G) := max{|/(x)| : x e G}, 

and apart from £ x there is one other £ p space of particular importance, and that is £ 2 . This 
is also a Hilbert space with inner product defined by 

</,<?> = 2 f{x)gjx) for all f,ge £ 2 (G). 

xeG 

Now, given f,g e £ 1 {G) we define their convolution to be the function / * g determined 
point-wise by 

y+z—x 

Given a finite set A c G we write \ia for the uniform probability mass function supported 
on A. (If G were locally compact rather than discrete then we should define \Ia as a 
measure but we do not need to involve the additional analysis here.) 

There are two ways in which convolution is useful. The first is because it is an average: 
in particular if / e £ l (G) and A a G is finite then / * ha(%) is the average value of / on 
x — A. In general this means that the convolution of two functions is smoother than the 
constituent functions and hence the convolution is easier to analyse. 

Secondly, convolution is useful to us because 

A + B := supp l A * 1 B for all A,BcG, 

so that we can analyse A + B through the (much easier to understand) function 1^ * \ B . 

In a certain sense convolution comes from integrating the regular representation and it 
will be useful to have some notation for this: we write 

p:G^ Aut(£ 2 (G)); x - (/ -> p x (f) : G - C; y ^ f(x + y)). 
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To be concrete, with the regular representation in hand we have that 

/ * g(x) = (f, P-x(g)) for all x e G 

where g(x) = g{—x) for all x e G. 

With this notation we can describe the idea behind the Croot-Sisask lemma. Suppose 
that S is an arithmetic progression and T is a much shorter arithmetic progression with 
the same common difference so that \S + T\ % \S\. 

Now the Croot-Sisask lemma will tell us that for / e £ p (G) the function f * Us does not 
change much when we translate by elements of T. To see this we recall from earlier that 
/ * Hs( x ) is the average value of / on x — S. Then if t e T we have x — S + t ^ x — S so 
that the average of / over x — S + t is approximately the same as the average of / over 
x — S. 

The full Croot-Sisask lemma is the following much stronger version of this argument 
replacing arithmetic progressions by any set with small doubling. 

Lemma 7.1 (Croot-Sisask). Suppose that f e £ P (G) for some p ^ 2, S, T a G are such 
that \S + T\ *5 L\S\, and rj e (0, 1] and p e [2, go) are parameters. Then the set of x such 
that 

\\Px(f * fis) ~ f * VsW&iG) < v\\f\\e?(G) 
is a symmetric neighbourhood of the identity and has size at least {2L)~ ^ ^\T\. 

The proof proceeds by random sampling: the idea is that since / * us is point-wise the 
average value of / on translates of S, this can be well approximated by the average value 
of / on a small set of 'typical' elements of S. We are then done if we let X be the set of 
elements of G such that translating these typical elements does not vary them very much. 
To make the notion of being well approximated precise we shall need an inequality called 
the Marcinkiewicz-Zygmund inequality, and for this we require a little more notation. 

Given p <= [1, oo) and (X, fi) a measure space we write L p ([i) for the space (of equivalence 
classes of) measurable functions on X endowed with the norm 

\\fh H ,)--= (j\f{x)\ p d^{x) 

Theorem 7.2 (Marcinkiewicz-Zygmund inequality). Suppose that p e [2, oo) and we are 

given independent random variables Xx, ■ ■ ■ , X n e L p (F) with X\ = 0. Then 

ii Yj Xi h p (v) = ° ( vp\\ Yj i x *i 2 Hlp 2 /2(p) 

i \ i 

Intuitively one might like to think of the X^s are independent variance one, mean zero 
random variables. Then the central limit theorem suggests that ^fn 1 ^ i X; ~ N(0, 1) 
and the pth moments of the normal distribution are well-known (and in any case easily 
computed); we have 
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Thus the Marcinkiewicz-Zygmund inequality can be thought of as saying that nothing 
much worse than this can happen. 

There is a special case of the Marcinkiewicz-Zygmund inequality called Khintchine's 
inequality which can be used in the proof of the former. 

Theorem 7.3 (Khintchine's inequality). Suppose that p e [2, go) and we are given inde- 
pendent random variables Xx, . . . ,X n e L P (F) with P(Xj = a,) = P(Xj = —a*) = 1/2. 
Then 

i/ 2 n 



I>iu, ( F) = o (^w 2 wc^ =oWi; 



Khintchine's inequality is proved by restricting to the case when p is an even integer (the 
other cases follow by nesting of norms) and then raising the left hand side to the power p, 
multiplying it out and collecting together terms. There are more elegant proofs but this 
gives the main idea. 

Given this, to prove the Marcinkiewicz-Zygmund inequality one can proceed by a process 
of symmetrisation. First, if the variables are complex then the result follows from taking 
real and imaginary parts and so one may as well assume they are real. We then take 
copies Yi, ... ,Y n of Xi, ... , X n such that X { ~ Y { and X 1 , . . . , X n , Yi, . . . , Y n are mutually 
independent. Following this we apply Khintchine's inequality to the variables X i: — Y] 
restricted to atoms of the sample space on which they are symmetric and only take two 
values. Collecting all this together gives the result. 




Proof of Lemma\7J\ Let z±, . . . , Zk be independent uniformly distributed ^-valued random 
variables, and for each y e G define Zi(y) := p- Zi (f)(y) — f*fJ>s(y)- For fixed y, the variables 
Zi(y) are independent and have mean zero, so it follows by the Marcinkiewicz-Zygmund 
inequality and Holder's inequality that 

k / k \P/ 2 

< 0{pY" \ (y i \Z i {y)\ 2 ) dA 

i=l 

< oipfV 2 - 1 ^ I \Hv)\ p di4- 

1=1 J 

Summing over y and interchanging the order of summation we get 

(7-1) 2 12^(0)1^04) < OipfPW^ [22 \Hv)\ p di4- 



\ z i\\ep(G) < \\P-Zi(f)h p (G) + \\f * Vs\\ep(G) < 2||/||£ P ( G ) 
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by the triangle inequality. Dividing (17. ip by k p and inserting the above and the expression 
for the ZjS we get that 



d^) = o(pk- 1 \\f\\% (G) y/ 2 . 



Pick k = 0{r\ 2 p) such that the right hand side is at most (v\\f\\ep(G)/^) p and write C for 
the set of x e S k for which the integrand above is at most (?7||/||^p(g)/2) p ; by averaging 
/4(£ c ) ^ 2-p and so ^ 1 - 2~ p > 1/2. 

Now, A := {(t, . . . , t) : t e T) has £ + A c (5 + T) fc , whence |£ + A| < 2L k \C\ and so 

(1a * 1-A, l-£ * l£)f2(Gfe) = ||l£ * 1a||2(g*) > |^| 2 |^|/2-^ fc , 

by the Cauchy-Schwarz inequality. 

By averaging it follows that at least \ A\ 2 /2L k pairs (z, y) e A x A have l^c*lc(z — y) > 0, 
and hence there are at least \A\/2L k = \T\/2L k distinct elements x e T — T a G with 
l-£ * ^-c{ x i ■ ■ ■ j x ) > 0; write X for this set. 

By design for each x e X there is some z(x) e £ and y(x) e C such that y(x)i = z(x)i + x. 
But then by the triangle inequality we get that 



\p- x {f * Us) -f*Hs\tr{G) < Ir2^-"( x )*^ - / * A*5|<p(G) 

+ Yj -Pz(x)i{f) ~ f * ) \\eP(G)- 

However, since p x is isometric on £ P (G) we see that 

1 k 

\\P-x(f '* Us) ~ f '* HsltP(G) < Wr2jP-v(x)i(f) ~ f * Ps\tP{G) 

i=l 



+ \\\Y A P-z(x) l {f) - f * Vs\\ep{G) < 2(?7||/||^ (G) /2), 

i=l 

since z(x),y(x) e C. □ 



The real strength here is the quality of the bounds for large p. For p = 2 a stronger 
result follows from Chang's theorem (at least in the case of good modelling in the sense 
of Green and Ruzsa |GR07j ) which can actually be used to show that the set on which 
f * Us is approximately invariant is not just large, but it actually contains a large Bohr set. 
The techniques for proving this are Fourier analytic in nature and yield doubly exponential 
dependence on p if they are used to prove a version of the above result. 

In the next section we shall make more careful use of the above result for large p, but 
here we just use the p = 2 case to give a set of polynomial growth following the outline in 
the previous section. 
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Proposition (Proposition [23]). Suppose that \A+A\ < Then A is exp(0(K log Re- 

covered by a symmetric neighbourhood of the identity of size at most exp(0 (log K))\A\ and 
relative polynomial growth of order 0(K log 3 K). 

Proof. We put f = 1a and apply the Croot-Sisask lemma with p = 2, S = T = A, and a 
parameter rj/m (where rj and m are to be optimised later) to get a symmetric neighbourhood 
of the identity, X, with \X\ > (2K)-°( m2 ^\A\ such that 

||Az(1a * Ha) ~ U * Ha\%(g) < r] 2 mT 2 \A\ for all x e X. 

It follows by the triangle inequality that 

\\p x (lA * Ha) ~ 1a * Ha\\%(g) < V 2 \ A \ fo r all x e mX, 

and then multiplying out the ^ 2 -norm we see that 

2||U * Ha\%(g) ~ %(PxO-A * Ha), 1a * Ha)p(g) < V 2 \A\- 
Of course by the Cauchy-Schwarz inequality we have that 

Ua * Ha\% {G ) > * Ha\\1 {g) > \A\/K, 

thus if we set if = 1/K we get 

(p x (l A * Ha), U * Ha)p{g) > \A\/2K. 

It follows that x e 2A — 2A, and so mX a 2A — 2A. Now by Pliinnecke's inequality we 
have that |(3Z + 1)(2A - 2A)| < A' 4 ( 3 ' +1 ) |A| and so 

|(3m/ + 1)X\ ^ |(3/ + l)mX\ ^ (2K)°^ l+m2K) \X\. 

We put I = m 2 K + 0(1) and write k := ml = m 3 K + 0(m) so that 

|(3Jfe + l)X| ^ (2K)° {m2K) \X\ = exp(0(A;m- 1 logA'))|X|. 

We can then pick m = 0(log K) such that the right hand side is strictly less than 2 fc |X| and 
hence |(3& + 1)X\ < 2 k \X\. Thus by Corollary 15.31 we have that X has relative polynomial 
growth of order 0(k) = 0(K log 3 K). 

On the other hand, since X is symmetric we have X — X c 2 A — 2 A and so \X — X\ ^ 
AT 4 |A| by the Pliinnecke-Ruzsa inequalities, but also X + A a 3 A — 2 A. Of course with 
these choices \X\ > exp(-0(K log K))\A\ and hence \X + A\ < exp(0(ATlog K))\X\ by the 
Pliinnecke-Ruzsa inequalities. With this information Lemma 15.41 completes the proof. □ 

This result gives bounds of roughly the same order as those of Green and Ruzsa [GR07] , 
and more or less represents the state of the art prior to Schoen's work |Schllj . 
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8. A WEAK BOGOLYUBOV-RUZSA-TYPE LEMMA WITH STRONG BOUNDS 

This section contains most of the newest material and we shall start with a proof of an 
asymmetric weak Bogolyubov-Ruzsa-type lemma with good bounds in line with the aims 
of SJBJ 

Before diving in it is worth making a few motivating remarks. Our starting point is the 
argument at the end of the last section (the proof of Proposition 12. 3j) . The weakness there 
was that we had to apply the Croot-Sisask lemma with a very small choice of rj. This was 
because we have the lower bound 

II 1 a * Va\\p( G) > \A\/K 

which is small when compared with the trivial upper bound of \A\. We should like some- 
thing somewhat larger, but as it is the lower bound may well be nearly this small. In 
[Schllj Schoen addressed this problem by proving the following important combinatorial 
lemma. 

Lemma 8.1 ( |Schlll Lemma 3]). Suppose that \A + A\ ^ K\A\ and e e (0, 1]. Then there 
are sets X a A — A and Y a A + A such that \X\ ^ exp (-0(2 e log K))\A\ and \Y\ ^ \A\ 
such that 

Uy * Vx\\% iG ) > K~ 2e \Y\. 

The proof of this is a beautiful induction using an observation of Katz and Koester 
|KK10j . which we shall not, unfortunately, have time to pursue here. 

Given this lemma we proceed along the lines of the proof of Proposition 12. 31 but using the 
sets Y and X given by the lemma instead of A and this yields the following proposition. 

Proposition 8.2. Suppose that \A + A\ < A^|y4|. Then A is exp(exp(0(v / log K)))- covered 
by a symmetric neighbourhood of the identity of size at most exp(0 (log K))\A\ and relative 
polynomial growth of order exp(0(-y/log K)) . 

Our approach here is somewhat different and instead of taking the inner product of 
1a * 1a ( or 1y * 1x) with itself we take a different function following Lopez and Ross 
[LR75] : 

(1a * V>a, 1a+a) = \A\. 

Given the above identity we should like to analyse 1a+a*Pa using the Croot-Sisask lemma; 
we do this now in the more convenient case of symmetric sets although the argument is 
not essentially different. 

Proposition 8.3. Suppose that S a G is symmetric and \S + S\ ^ -K'I'S'I; T has \S + T\ ^ 
L\S\, and m e N is a parameter. Then there is a symmetric neighbourhood of the identity, 
X , with 

\X\ > exp(-0(m 2 log K log L))\T\ and mX c 4S. 

Proof. We put / = 15+5 and apply the Croot-Sisask lemma with a parameter rj (to 
be optimised later) to get a symmetric neighbourhood of the identity, X, with |X| ^ 
(2L)-°( ? ^ 2p )|T| such that 

\\Px{1s+s * /As) - 1s+s * f*s\\eP(G) < vUs+s\\ep(G) for all x e X. 
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It follows by the triangle inequality that 

\\Px(ls+s * /As) - l S+s * fJ-sWep(G) < VmUs+sWepiG) for all x e mX. 
Taking an inner product with /xg we see that 

\(PxO-s+s * Us), Us) - 0-s+s * /As,/As>| < r]rn\ls+s\ep{G)\^s\ep'{G) 
where p' is the conjugate exponent to p. Now 

(ls+s * /As, /As) = Q-s+s, Us * /AS> = 1, 
since S is symmetric and supp jjs * Ps ^ S + S. Thus 

l/AS * 15+5 * /As(z) - 1| < ^!|l5+5||^(G)||/^5||^'(G) < V mKl/P - 

We take p = 2 + logK, and then 77 = f2(m _1 ) such that the term on the right is at most 
1/2 to get the desired conclusion. □ 

As a consequence of this we already get the following poly-logarithmic bounds. 

Proposition (Proposition 12.4ft . Suppose that \A + A\ < A"|v4|. Then A is exp(0(log 4 K))- 
covered by a symmetric neighbourhood of the identity of size at most exp(0 (log K))\A\ and 
relative polynomial growth of order 0(log K). 

Proof. We apply the previous result with T = S = A — A and a parameter m e N to be 
optimised later to get a symmetric neighbourhood of the identity, X, with 

\X\ > exp(-0(m 2 \og 2 K))\A~ A\ and mX c A(A - A). 

Given I e N also to be optimised later, by the Pliinnecke's inequality we have that 

\(3ml + l)X\ < \(3l + l)4(A-A)\ ^K°W\A-A\ < K°® exp(0(m 2 log 2 K))\X\. 

We now put I = m 2 log K + 0(1) and write k := ml = m 3 log K + 0(m) so that we have 

|(3fc + l)Jf| < exp(0(m 2 log 2 K))|X| = exp(0 (km' 1 log K))\X\. 

We can then pick m = 0(logK) such that the right hand side is strictly less than 2 fc |X| and 
hence \(3k + 1)X| < 2 fc |X|. Thus by Corollary 15.31 we have that X has relative polynomial 
growth of order 0(k) = 0(log K). 

On the other hand we have X — X a 4A — 4A and so \X — X\ < i^ 8 |A| by the Pliinnecke- 
Ruzsa inequalities, but also I + Ac5i- A A. Hence \X + A\ < exp(0(log 4 K)) \X\ by the 
Pliinnecke-Ruzsa inequalities. With this information Lemma 15.41 completes the proof. □ 

We saw Proposition 18.31 with S = T is already rather powerful, but Konyagin introduced 
a rather nice bootstrapping technique whereby the result is first applied iteratively to 
reduce L to 0(1). To do this we first note the following corollary of Proposition 18.31 

Corollary 8.4. Suppose that S a G is a symmetric neighbourhood of the identity and 
\S + S\ < T is a symmetric neighbourhood of the identity with \S + T\ < L\S\, and 

D > 1 is a parameter. Then there is some symmetric neighbourhood of the identity, T' , 
such that 

\T'\ ^ exp(-0(D 2 logLlogK))\T\, 
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and a symmetric neighbourhood of the identity S' with S c 5' c 55 and \S' + T'\ i> 
K l ' D \S'\. 

Proof. Let k be a natural number to be optimised later and apply Proposition 18.31 to get 
a symmetric neighbourhood of the identity, X", such that 

\T'\ > exp(-0(& 2 logLlog/\"))|T| and AS 3 kT'. 

It follows that \S + kT'\ < |5/S| < i^lS"! by Plunnecke's inequality. Thus by the pigeon-hole 
principle there is some I e {0, . . . ,k — 1} such that 

\(S + IT') + T'\ < if 5 / fc |5 + ir'|. 

Of course we can pick = 0(D) such that K 5 / fc < K X I D and so putting S' := S + IT' the 
corollary is proved. □ 

The pigeon-holing trick was developed by Tao in [TaolOj to establish a Freiman-type 
result in the non-Abelian setting but has since found use in the Abelian setting. 
We are now in a position to apply the above corollary iteratively. 

Proposition 8.5. Suppose that \A + A\ < ^|^4|- Then there is some symmetric neigh- 
bourhood of the identity, T, a natural number r = 0(\og\ogK) 0( ^\ and a symmetric 
neighbourhood of the identity S with /I - i c S c r(i - A) and 

\S + T\= 0{\S\) and \T\ > exp(-0(loglog K")° (1) log 3 K)\S\. 

Proof. We define two sequences of sets (Si)i and (Tj)j, and a sequence of reals (Lj)j such 
that Si and Tj are symmetric neighbourhoods of the identity, and 

A-icSjC 5\A ~ A) and \Si + T { \ < Li\Si\, 

where Lj = exp(4(log 2iT) 2 ). To start with we put Sq := A — A and T := A — A which 
satisfies the requirements by the Pliinnecke-Ruzsa inequalities. At stage i we note that 

\Si + Si\ < \2-5\A-A)\ KK*- &t \A-A\ < K^^S^ 

by the Pliinnecke-Ruzsa inequalities. We apply the previous corollary to the sets Si and 
Tj with parameter Di := 1 + (log(2K 4 ' 51 )) 1-2 <l+1> to get symmetric neighbourhoods of the 
identity S i+ i and T i+ i, with 

\T i+1 \ > exp(-0(A 2 (logL J )(logir 4 - 5I )))|r j | ^exp(-0(exp(0( ? ))log 3 K))|T l |, 

A - A c Si c c 55, c 5 m (A - A) 

and 

\S i+1 + T l+1 \ < exp(4(log2K) 2 " (l+1) )|5 m |. 

We terminate the iteration when 2* +0 W = log 2 log 2 K" and find that the result is proved 
with S = Si and T = Tj. □ 

Finally we have the strongest result of the section and the driving ingredient in this 
survey. 
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Proposition (Proposition [21)]) . Suppose that \A+A\ < K\A\. Then A is exp(0(log 3+o(1) Re- 
covered by a symmetric neighbourhood of the identity of size at most exp(0(log 1+o(1) K))\A\ 
and relative polynomial growth of order 0(log 3+ °^ R). 

Proof. We apply Proposition 18.51 to the set A to get symmetric neighbourhoods of the 
identity S and T, and a natural number r = 0(\og°^ R) such that 

A - A c S c r (A - A), \S + T\ = 0(\S\) and \T\ > exp(-0(log 3+o(1) K))\S\. 

Now, by Proposition 18.31 applied to the sets S and T with a parameter m to be optimised 
later we get a symmetric neighbourhood of the identity X with 

|X| > exp(-0(m 2 log 1+o(1) R))\T\ and mX c 45. 

Given I e N also to be optimised later, by Pliinnecke's inequality we have that 

\(3ml + l)X\ < |(3/ + 1)45| < lf W|5| 

< iT 0(!) exp(0(m 2 log 1+o(1) A') + 0(log 3+o(1) Ji~))|X| 

We now put / = m 2 log°^ K and write k := ml = m 3 log°^ R so that we have 

|(3fc+ 1)X\ ^ exp(0(fc(m _1 log 1+o(1) R + m" 3 log 3+o(1) A))|X|. 

We can then pick m = \og 1+ °^ R such that the right hand side is strictly less than 2 fc |X| 
and hence \(3k + 1)X\ < 2 k \X\. Thus by Corollary 15.31 we have that X has relative 
polynomial growth of order 0(k) = Ofiog^ ^ R). 

On the other hand we have X — X a AS a 4r(A — A) and so, by the Pliinnecke-Ruzsa 
inequalities, we have 

\X - X\ < \4rA - 4rA\ ^ R 8r \A\ < exp(0(log 1+o(1) R))\A\. 

This set inclusion (and the fact that Og e X) also tells us that X + A cz 4r(A — A) + 
A. Hence, by the Pliinnecke-Ruzsa inequalities again, and the fact that \A\ < 151 ^ 
exp(0(log 3+o(1) K))\X\ we have 

\X + A\ < R 8r+1 \A\ = exp(0(log 1+o(1) R))\A\ < exp(0(log 3+o(1) R))\X\ 

With this information Lemma 15.41 completes the proof. □ 

It may be worth saying that all the log *- 1 ^ R terms in the above proposition can be 
replaced by (log log R)°^ terms if desired. 

9. From relative polynomial growth to convex coset progressions 

Our aim in the next few sections it to prove Theorem 12.71 which we restate now for 
convenience. 

Theorem (Theorem 12 .7p . Suppose that X has relative polynomial growth of order d. Then 
there is a centred convex coset progression M such that 

X -X c M,dimM = 0(d\og 2 d) and \M\ < exp(0(dlog 2 d))\X\. 
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We shall make considerable use of harmonic analysis on discrete groups to do this and so 
it will be useful to record some definitions. The classic reference is Rudin |Rud90] although 
the reader will be equally well served by Tao and Vu |TV06] . 

We have already introduced convolution, and the Fourier transform is defined to diago- 
nalise the operators induced by convolution, so we are already have quite a bit of what we 
need. 

Given G (discrete) we write G for the set of homomorphisms 7 : G — > S 1 where S 1 : = 
{z e C : \z\ = 1}. These homomorphisms are called characters and the set G naturally 
supports the structure of a topological group, in particular a compact Abelian group under 
point-wise multiplication of characters, called the dual group of G. 

The dual group is naturally endowed with a translation invariant probability measure 
called the Haar probability measure and we are now in a position to define the Fourier 
transform. Given / e ^(G) we define the Fourier transform of / to be the function 
/ e L°°(G) determined by 

xeG 

This has the property that / * g = f ■ g. More than this we have Plancherel's formula 
which tells us that 

We have already indicated that G has a natural topology, and in fact if G is small enough 
this topology is induced by a metric. There are then a range of metrics which define 
different topologies of G reflecting the subgroup structure of G. These can be defined by 
bases of what are called Bohr sets. 

Given a neighbourhood T of characters on G and a parameter 5 e (0,2] we define the 
Bohr set with frequency set T and width 5 to be the set 

Bohr(r, 6):= {xeG: \-y(x) - 1| < 8 for all 7 e T}. 

One rather useful property of Bohr sets which we use repeatedly is the fact that they are 
balls in a pseudo-metric. What we mean by this is that for a character 7 e G we have the 
very useful triangle inequality 

|1 - j(x + y)\ = |1 - 7 (x) + (1 - 7(z/)h0)| < |1 - l{x)\ + |1 - 70)1 
for all x,y e G. 

The first ingredient in proving Theorem 12.71 is to show that in some sense the topology 
determined by a set X is roughly the same as that determined by certain Bohr sets. 

Proposition 9.1. Suppose that X has relative polynomial growth of order d. Then there 
is a neighbourhood of characters T and a natural number k = 0(d\og 2 d) such that 

X - X c Bohr(r, 1/(4(3* + 1))) and | Bohr(r, 1/2) | < 2 k \X\. 

Now we shall see later that Bohr sets are already convex progressions, and if they satisfy 
a certain growth condition of the form used in Chang's covering lemma then they turn out 
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to be low- dimensional. In particular we have shall show the following which combines with 
the previous result to yield Theorem 12.71 

Proposition 9.2. Suppose that Bohr(T, 5) is a finite Bohr set and k e N is such that 

|Bohr(r,(3fc + l)<y)| < 2 fc |Bohr(r,5)| for some S < l/(4(3fe + 1)). 
Then Bohr(r,<5) is an (at most) k- dimensional centred convex coset progression. 

10. Relative polynomial growth and Bohr sets 

In this section we show how to pass from sets with relative polynomial growth to a Bohr 
set which (effectively) has polynomial growth of relatively low order. Shortly we shall see 
that Bohr sets are convex coset progressions (provided the width parameter is sufficiently 
small), but for now we think of them as a sort of 'approximate annihilator'. 

To find an appropriate Bohr set we shall need to examine the (very) large spectrum of 
a finite set A, which is defined to be the set 

LSpec(A e) := {7 6 G : ||1 - 7lU 2 (^*^) < e}- 

(Note immediately that LSpec(A, e) is a neighbourhood since A is finite.) The definition 
of LSpec we have given takes the form it does for ease of use of the triangle inequality: 
if 7 e LSpec(A, e) and 7' e LSpec(A, e') then 7 + 7' e LSpec(A, e + e') by the triangle 
inequality: 

||1 -1i'\\l*(» a *»_ a ) = ||(1 -7') + (1 ~ lh'h*(n A *n- A ) 

< II 1 - t'IU 2 (/m */*-,o + IK 1 - 7) 7' II l 2 (ma a) 

= ||1 - 7||l 2 (am*m-a) + II 1 - 7'I|l 2 (ma*m-a)- 

On the other hand to connect the definition to the idea that LSpec should represent the 
large spectrum we have the following useful identity: 

111 -711!^.^) =2(1-|/m( 7 )D, 

so that 

II 1 - 7||l 2 (/m*m-a) ^ e if and onl y if I#a(7)I > V 1 - e V 2 - 

This fact will be used extensively in the remainder of the section. 

We have two key tools for establishing our main proposition (Proposition 19. ip . The first 
of these uses an approximation developed by Schoen in |Sch03] and imported into this 
context by Green and Ruzsa in |GR07j . 

Proposition 10.1. Suppose that X has relative polynomial growth of order d. Then 

|Bohr(LSpec(X,e),l/2)| < exp(0(dloge" 1 (i))|X|. 
Proof. By Plancherel's theorem and the Cauchy-Schwarz inequality we have 

l( fc )||2 I v\2k 



(Id) J|lx(7)| 2 ^7 = m* ll, 2(G) ^ |wmni(j)| = tt^ 



(fc)||2 ^ I) L X HH G) 

supp l x 
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We shall show that most of this mass is supported on the set of characters where the 
Fourier transform of 1^ is very large. In particular note that 

f |G(7)| 2 ^7 < {^l^j2\X\f k - 2 [\h 

JLSpec(X,e) c J 

= (l-e 2 /2) k - l \X\ 2k -\ 

by Parseval's theorem. 

Since X has polynomial growth of order d we have that \kX\ < k d \X\ for k ^ 1, so there 
is a positive integer k with k = 0{e^ 2 d\oge^ 1 d) and 

(1 - e 2 l2) k - 1 ^ l/2k d < \X\/2\kX\, 

whence 

\X\ 



2/v 



|lx(7)| 2 ^7< 



'LSpec(X,e) c 

Thus, by ( 110. ip we have 

\X 



2\kX\ 



lx( 1 )\ 2k d 1 > 



2\kX\ 



LSpec(X,e) 

Now, let B be a finite subset of Bohr(LSpec(X, e), 1/2). Integrating we get that |1 — 
/Tb (7) I < 1/2 for any 7 e LSpec(X, e) and it follows by the triangle inequality that |/Tb(t) I ^ 
1/2. Consequently 

f* c* I "v 1 2/ti 

|Q(7)| 2 1^(7)| 2 rf7 > 2- 2 |Q(7)| 2 ^7 > 



'LSpec(X,e) 

On the other hand 



2 3 \kX\' 



|lx(7)r fc |^(7)l^7 < |X| 2fe - 2 ||l x */i B || £ 2 2(G) 

\X\ 2k 2 \\lx * I^B\\e^(G)\\^x * ^b\i^(g) 

by the Hausdorff- Young inequality, Parseval's theorem and then Holder's inequality. Since 
II lx * /Ab||^(g) = \X\ we conclude that 

\X\ \X\ 



2 3 |A;X| 11 A nonc (5, 

This gives the desired upper bound, but on B rather than Bohr(LSpec(X, e), 1/2). The 
result follows since B was an arbitrary finite subset of Bohr(LSpec(X, e), 1/2). □ 

Our second key tool is yet another of the developments of Green and Ruzsa from [GR07] . 
It is only slightly more general than |TV06t Proposition 4.39]. 

Proposition 10.2. Suppose that \X + S\ < K\S\ and e e (0, 1] is a parameter. Then 

X - X c Bohr(LSpec(X + S, e), O(eVK)). 
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Proof. Write 6 = 1- V 1 - e 2 /2 and suppose that 7 e LSpec(X + S,e). Then there is a 
phase u s S 1 such that 

2 1 x+s(x)uj^(x) = ulx+sh) = \lx+s(l)\- 

xeG 

Since the right hand side is real we conclude that 

2 lx+s(ar) Rea; 7 (x) = Re J] lx+s^a^x) = |Q(t)| > (1 - 6)\X + S\. 

xeG xeG 

It follows that 

2 WO)|l - ojj(x)\ 2 = 2 J] l x+ s(x)(l - Reu^x)) ^ 25\X + S\. 

xeG xeG 

Ey ,yieX then 

2 l s (x)|l - tu^ yi h(x)\ 2 ^ 2 l X+ s(x)\l u^x)] 2 ^ 25\X + S\. 

xeG xeG 

The 2-variable Cauchy-Schwarz inequality applied to 1 — u , y(y )j(x) and 1 — u)j(yi)j(x) 
tells us that 

|i -i(yo-yi)\ 2 = l(i - ui(yoh(x)) - (l - ^i{yih(x))\ 2 

< 2(|1 - a; 7 (yo)7(^)| 2 + |1 - ^7(2/1)7^) T) 
for all x e G since |w| = 1 and | 7 (x)| = 1, whence 

|5||1 - 7(2/0 - 2/i)| 2 = 2 l5(x)|l - 7 (yo - 2/i)| 2 < 2 3 5|X + S\. 

xeG 

The result follows since 5 = 0(e 2 ). □ 

With these two results we are in a position to prove the main result of this section. 

Proposition (Proposition 19. II) . Suppose that X has relative polynomial growth of order d. 
Then there is a neighbourhood of characters V and a natural number k = 0(d\og 2 d) such 
that 

X - X c Bohr(r, l/(4(3fc + 1))) and | Bohr(r, 1/2) | < 2 k \X\. 

Proof. Since X has relative polynomial growth of order d we may apply the pigeon-hole 
principle to pick I = O(dlogd) such that \X + IX \ = 0(\lX\). Let e be a parameter to 
be optimised later. By Proposition 110.11 applied to the set (I + 1)X which has relative 
polynomial growth of order 0(d\ogd) we see that for T := LSpec(X + IX, e) (which is 
closed) we have 

I Bohr(r, 1/2)| «S exp(0(d log 2 e _1 d))|X|. 
On the other hand, by Proposition 110.21 applied to the sets X and IX we see that 

Bohr(r,0(e)) ^ X - X. 

We now pick k = f2(e _1 ) such that the width parameter above is at most l/(4(3fc + 1)) 
and the size bound is less than 2 k . This is possible with e = f2(l/(<ilog 2 d)). The result is 
proved. □ 
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11. Ruzsa's EMBEDDING AND CONVEX COSET PROGRESSIONS 

In the paper |Ruz94] Ruzsa developed an important embedding for relating Bohr sets 
and convex coset progressions. Given a set T of characters on 67, write £>(T,R) for the 
vector space of bounded real- valued functions on T. Now, we define the map 

R r :G — B(r,R) 

x h-> R r (x) : r — R;7 — log7(x), 

2m 

where the logarithm takes its principal value. (Since |7(x)| = 1 this means that the 
logarithm lies in (— tti, Tci] and so the functions are bounded.) 

The map preserves inverses provided ||i?r(^)||oo < 1/2, meaning that Rr(—x) = 
—Rr(x); and furthermore we see that if 

||#rOi)ll°o + ■ • • + |#rO<i)lao < V 2 

then 

Rr(xi-\ hid) = Rr(xi) H h Rr(x d ). 

This essentially encodes the idea that Rr behaves like a Freiman morphisnfl, although 
we shall not formalise this notion here. We use this embedding to establish the following 
proposition. 

Proposition (Proposition 19.21) . Suppose that Bohr(r,<5) is a finite Bohr set and d e N is 
such that 

|Bohr(r, (3d + 1)5)\ < 2 d \ Bohr(r, S)\ for some 5 < l/(4(3d+ 1)). 

Then Bohr(r,<5) is an (at most) d-dimensional centred convex coset progression. 

Proof. We shall prove that if L := P|{ker7 : 7 e T} is trivial then Bohr(r,5) is a d- 
dimensional centred convex progression. The result then follows from this by quotienting 
out by L (which does not impact the hypotheses of the proposition) to get a homomorphism 
: 7j d — > G/L and a symmetric convex body Q c IR d such that Bohr(r, S)/L = <p(Q n Z d ). 

Let ei,...,ed be the standard set of generators for Z d and for each i e {1, . . . , d] let 
hi g G be a representative of 0(ej). Since Z d is free define <fi : Z d — > G by extension from 
its value at the generators 0(ej) := /ij and note that 

Bohr (r, <*) = |J Bohr (r, <J)/L = (J 0(Q n Z d ) = 0(Q n Z d ) + L; 

The result follows. 

For notational convenience we write B v := Bohr(r,^) for any rj e (0,2]. To start with 
note that if x e B v then 

II -Rr (x) || 00 < 7^-arccos(l - rf/2) ^ 2n. 

Z7T 

Since 2 (3d + 1)5 < 1/2 we have that if x±, . . . , x^+i G B$ then 
(11.1) Rrixx + ■■■ + x 3d+1 ) = Rr(x!) + ■■■ + R r (x 3d+1 ). 



^We direct the unfamiliar reader to |TV061 Chapter 5.3] 
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By hypothesis we then have that 

\{3d+l)R r {B s )\ = \R r {{3d+l)B s )\ < \(3d+l)B 5 \ < \B (3d+1)5 \ < 2 d \B s \. 

Now 1 5,5 1 = \Rr(Bs)\ since R-p is injective on B&. To see this note that if x,y e i?^ have 
-Rr(x) = Rr(y) then i?r(^ — y) = by flll.ljl and the fact that Rp preserves inverses on 
B$. It then follows that j(x — y) = 1 for all 7 e T, and since L is trivial we conclude that 
x = y. 

In light of all this we have that \3dRr(Bs) + Rr(B$)\ < 2 d \Rr(Bs)\, and so by the variant 
of Chang's covering lemma in Lemma [5.21 applied to the sets 3Rr(B$) and Rr(Bs) (both 
of which are symmetric neighbourhoods since Rr preserves inverses and the identity, and 
B$ is symmetric) we get a set X a 3Rr(Bg) with \X\ < d such that 

3R r (B s ) c Span(X) + 2R r (B s ) c <X> + 2R V (B S ). 

Here, of course, (X) denotes the group generated by X. It follows that for all n e N we 
have 

(n + 2)R r (Bs)^(X) + 2R r (B 5 ). 

Now, for each v e Rr(Bs) and neN there is some v n e 2Rr(B$) such that nv e (X) + v n . 
However, since 2Rr(Bs) is finite it follows that there are distinct natural numbers n ^ m 
such that v n = v m whence 

(n — m)v = nv — mv e ({X} + v n ) — ({X} + v m ) = {X). 

Thus to every v e Rr(Bs) there is some natural number l v such that l v v e (X). Let L be 
the lowest common multiple of all the natural numbers (l v )veR r (B s ) so that Lv e (X) for 
all v e Rr(Bs). It follows that v e (x/L : x e X) and so Rr(Bs) generates a lattice A in 
J3(r,R) of dimension k < \X\ < d. 

Let Vi, . . . ,Vk be a basis for A and for each j e {1, . . . , k} write Vj = JLeB Zj iX Rr(x) for 
some integers (Zj )X ) xe B s - We now put hj := ^ xeBg Zj >x x and define a homomorphism 

4> : Z fe -> G; (ni, . . . , n k ) *-* nifo H h n k h k . 

Finally write V for the subspace of B(T, M) generated by X and ifj : V — > K fc for the change 
of basis taking to the canonical basis vector of M fc , and let Q be the cube in B(T, M.) 
centred at the origin and with side length 25. The set ip(Q n V) is a symmetric convex 
body in M. k and it remains to check that (f)(ip(Q n V) n Z fc ) = B$. 
If Xo e i?^ then i?r(xo) 8 ^ an< ^ -^r(^o) e Q an d so 

Rr(xo) = n\V\ + • • • + n k v k for some n e ip(Q n V) n Z fc . 

Given the definition of the WjS we have that 

k 

Rr(x ) = ^rij 2] Zj, x Rr(x). 

j=l xeB$ 
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Exponentiating this point-wise (via x >— > exp(2-/rix) which is a homomorphism from B(F, R) - 
BiT^ 1 )) tells us that 

k / \™j k 

7(^0) = n n ^^ zj> ) = 7 S ^ s f ° r ai1 ^ e r - 

j = l \xe_Bj / j=l xeB s 

Since L is trivial we conclude that 

k 

X = Yj n j Yj Z 3> xX = Ulhl + • • • + n k h k- 

3=1 xeB s 

It follows that 4>(n) = xq, and so xq e 4>(ip(Q n V) n Z fc ). 

In the other direction suppose that Xo e (p(ijj(Q n V) n Z fc ) and Uo 8 Q n A is such that 
Xq = (j)(ip(v )). Then t> e A and so 

w = niUi + • • • + nfcVfc for some n e Z fc , 

and so 

k 

v o = Yj n i Yj z hxRv{x). 

3=1 xeB s 

We exponentiate point-wise as before to get that 

k / \ n i k 

exp(2niv ) = \\[ \ l( x Y j ' x I = 7(^1 n i Yj ^> x ) for a11 7 e T. 

j=l \xeB s J 3=1 xeB s 

But Vo e Q and so |1 — exp(27ra;o)| < 8 for all 7 e T and hence 

x = 0(n) = Y n J h j = Y n 'J Y z i> xX 8 B& 

3=1 j=l xeB s 

as required. The result is proved. □ 

In light of the start of the proof here it might be more natural to define a centred 
convex coset progression to be a set of the form [J<f)(Q n Z d ) where <p : Z d — > G/iJ is a 
homomorphism, H ^ G and Q is a symmetry convex body in R d . This sort of consideration 
becomes more relevant as one moves to the non-Abelian setting but this is not our concern 
here. 

12. Concluding remarks 

First we should note that Theorem 11.41 follows immediately from combining Proposition 
12.51 and Theorem 12 .7\ and all the log *- 1 -' K terms can be replaced by (log log -ft') ^ terms 
for those interested. 

It may be worth noting that there are really three different functions in Theorem II. 3t 
we really show the following. 

Theorem 12.1. Suppose that A a G has \A + A\ < -K"|^4|- Then A is exp(h(K)) -covered 
by a d(K) -dimensional centred convex coset progression M of size at most exp(f(K))\A\. 
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The quantities h(K), d(K) and f(K) can be traded off between each other to some 
extent but there is an associated cost. The precise relationships are a little ad- hoc because 
they reflect different combinations of our three main examples. Let us recall these now: 

(i) (Cosets of subgroups) Suppose that H is a finite subgroup of G and X is an H- 
separated set of 2K + 0(1) points. Then letting A := X + H we have \A + A\ ~ 
K\A\. 

(ii) ( Convex progressions ) Suppose that M is a (i-dimensional convex coset progression. 
Then we have seen that \M + M\ < exp(0(d))\M\. On the other hand if A is a 
cube in Z d (so that all he side lengths are the same) then in fact + A| ~ 2 d |^4| 
so that the doubling of A really is this large. 

(iii) ( Subsets of subgroups ) Suppose that if is a finite subgroup of G and A is a randomly 
chosen subset of H, taking x e H with probability 1/K. Then with high probability 
L4| ~ \H\/K and \A + A\ ~ \H\ so that \A + A\ ~ K\A\. 

Each of these suggests a lower bound on (respectively) h(K), d(K) and f(K), but they do 
not all give such bounds and there is no one example which forces lower bounds on all of 
them simultaneously. This is because of the previously mentioned ability to trade which 
we shall now explain in a little more depth. We assume that we are given Theorem 112.11 
with some functions h(K),d(K) and f(K). 

12.2. Reducing h(K) in exchange for d(K). One can eliminate h(K) entirely and re- 
place 'exp(/i(X))-covered by' in Theorem 112. II by 'contained in' at the expense of replacing 
d(K) by d(K) + exp(h(K)), and f(K) by 2f(K). This is a little fiddly, but not difficult 
to do. 

Removing the dependence on covering number is the additional requirement which is 
made in traditional statements of Freiman-type theorems; indeed, Green and Ruzsa in 
[GR07] actually proved the following. 

Theorem 12.3 (Green-Ruzsa theorem, original version). Suppose that \A + A\ < 
Then A is contained in a K i+ °^ -dimensional centred convex coset progression M of size 
at most exp(A' 4+ °W)|A|. 

This has been slightly improved, with the power of 4 + o(l) being replaced by 1 + o(l) 
but the reason we do not use this formulation is that the dimension bound must be at least 
Q(K) - exponentially worse than in the Polynomial Freiman-Ruzsa conjecture. This is, of 
course, suggested by the fact that reducing the covering number has a cost of exp(h(K)) 
rather than h{K) associated with it. 

To see the difficulty directly suppose that A is a set of 2K + 0(1) generators of a 
torsion-free group. Then |A + A| ~ if|A|, but any convex coset progression containing A 
has dimension at least 2K — 0(1). 

12.4. Reducing d(K) in exchange for f(K). In general one cannot trade all of the 
dimension in for size, but one can if the group has bounded exponent (meaning every 
element has order bounded by an absolute constant). Then one may reduce d(K) to at 
the expense of replacing f(K) by exp(f(K) + 0(d(K))). In Theorem 11.41 this gives the 
following result. 
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Theorem 12.5. Suppose that G is a group of bounded exponent and A a G has \A + 
A\ < ^|^4|- Then A is exp(0 (log 3+ °^ K))- covered by a subgroup M of size at most 
exp(0(log 3+o(1) K))\A\. 

Conjecturally one can do much better, and here the Polynomial Freiman-Ruzsa conjec- 
ture becomes the following which was one of its (PFR's) original motivations. 

Conjecture 12.6 (Marton's conjecture). Suppose that G is a group of bounded exponent 
and A a G has \A + A\ s$ A'|A|. Then A is exp (O (log K))- covered by a subgroup M of 
size at most exp (O (log A')) | A| . 

12.7. Reducing d(K) in exchange for h(K). We just saw how to trade dimension in for 
size in the case where the group has bounded exponent. In general one cannot trade all of 
the dimension in for size but Green and Tao in [GT06j show (in torsion-free groups) how 
to reduce the dimension of the progression to 0(log A') while incurring an exponential cost 
in the covering number so that h(K) = 0(A). (They get a larger polynomial in K in their 
work but this can be removed given the recent stronger bounds in Freiman's theorem.) 

The paper [GT06j is, in general, rather useful as a source of tools for giving the lower 
bounds on the order of relative polynomial growth of sets and we direct the reader interested 
in the more precise relationships between h(K),d(K) and /(A) there. 

As a final remark it is worth saying that convex progressions may not be quite the right 
notion to deal with and one might like to ask for a convex progression of a particular type. 
There is some discussion of this in |GT06] but we shall not pursue this here, except to 
remark that Freiman's theorem is usually stated using generalised arithmetic progressions 
which are a special type of (translate of a centred) convex progression defined by a cube. 
Specifically a set M is a generalised arithmetic progression if 

M = {x + zxxi H h z d x d : \k\ < Li} 

for some natural numbers L 1; . . . , L d and elements xq, . . . , x d e G. If we define a homo- 
morphism 

<p : 7L d -> G; (zi, ...,z d )^> z x x x H V z d x d , 

and a convex set Q := Ylt=i [~ Li, Li] then M = x$ + </>(Z d n Q). A coset progression 
(as defined by Green and Ruzsa in |GR07j ) is then a set of the form M + H where H < G 
and M is a generalised arithmetic progression in G. Proving the results of this paper for 
coset progressions instead of convex coset progressions is not conceptually harder, but does 
seem to involve some additional technical difficulties. 

Generalised arithmetic progressions have been studied in there own right and there are 
various questions concerning whether they are proper or not, meaning whether <p is injective 
on Q n Z d . Bilu in |Bil99] has a nice discussion of this (see also |TV06t §3.1]). 

13. Applications 

As indicated in the introduction there are numerous applications of Freiman's theorem, 
and for completeness we shall discuss a few of these here. These are mainly chosen because 
they do not require too much additional material to develop rather than because they are 
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necessarily the most exciting. This section is of a much more sketchy nature than the rest 
of the paper: it is intended to indicate directions one can take the results discussed in this 
paper; it is not intended to cover them in detail and the interested reader is referred to the 
papers indicated in each subsection below for more comprehensive discussions. 

One thing it is worth remembering is that while Freiman's theorem is very attractive 
at a qualitative level, in applications one can often squeeze a little more juice out of the 
situation by using the methods of this paper rather than the results. In particular the 
combinatorial arguments on their own are often enough for what one hopes to do. In this 
regard it should be mentioned that there are many direct applications of the techniques of 
Croot and Sisask in |CS10j and |CLSllj . which can also be proved using Freiman's theorem 
but which only really require the Croot-Sisask lemma. 

A second remark is due with regard to Roth's theorem. The reader may be hoping for 
a discussion of bounds in Roth's theorem in this survey, but this is not really the place for 
that. In particular, while the results of Proposition 12.41 are relevant to that work, nothing 
else from the paper is, and a discussion of the combinatorial techniques of Katz and Koester 
|KK10] and the regular Bohr set technology of Bourgain |Bou99] would be required. 

The f/ 3 -inverse theorem. Gowers' work |Gow98] marks the start of an explosion of 
applications of Freiman's theorem after he made the crucial observation that it can be 
combined with the Balog-Szemeredi lemma |BS94j . Gowers used Freiman's theorem to 
improve the bounds in Szemeredi's theorem for arithmetic progressions of length four and 
a little after that Green and Tao expressed Gowers' ideas in a framework often described as 
'quadratic Fourier analysis'. Indeed, Gowers' original aim seems to have included finding 
a proof of Szemeredi's theorem which was closer to Roth's proof of Roth's theorem for 
arithmetic progressions of length three and Green and Tao's framework helps highlight 
these parallels. This subsection is more thoroughly explained in the paper |GT08j . 

Roth's proof of Roth's theorem has, at its core, something now called a ?7 2 -inverse 
theorem. The [7 2 -norm of a function / on a finite (compact) Abelian group G is defined 
by 

\\f\\u 2 {G) = E x,y,zeGf{x)f(x + y)f(x + z)f(x + y + z) . 

It turns out that this is a norm and if A and B are two sets in G with ||1^ — Ib\\u 2 (G) 
small then the number of three-term arithmetic progressions in A is close to that in B. 
This is why the [7 2 -norm is useful for understanding problems about three-term arithmetic 
progressions. It turns out that if a function does not have small f/ 2 -norm then it has a 
linear bias in the following sense. 

Theorem 13.1 (£/ 2 (F2)-inverse theorem). Suppose that f e L X (W%) has \\f\\u 2 (F%) ^ 
^||/||l oo (f?) ■ Then there is a linear polynomial I : Fj — > F 2 , meaning a map x >-> r ■ x 
for some r e such that 

K/> (-1)% 2 (F™)| > £° (1) ||./1z/»(F2)- 

This is essentially trivial to prove and, a version for the group G = Z/iVZ rather than F^, 
can be used as the basis for an iteration to prove Roth's theorem on three-term arithmetic 
progressions. 
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Now suppose that one is interested in four-term arithmetic progressions. In this case if 
we have two sets A and B with ||1^ — ^b\\u 2 (g) small it is not necessarily the case that 
A and B have similar numbers of four-term arithmetic progressions. There is, however, a 
stronger norm called the £/ 3 -norm for which this is true. The ?7 3 -norm of a function / on 
a finite (compact) Abelian group G is defined by 

II/IIlt3 (G ) = E X! y >z>weG (^f(x)f(x + y)f{x + z)f(x + w)- 

x/(x + y + z)f{x + y + w)f{x + z + w)f(x + y + z + u>)^ . 

It turns out that this is also a norm and there is a [7 3 -inverse theorem. This is where 
Theorem 11.41 can be inserted into the various proofs of the inverse theorem. For F2 this is 
due to Samorodnitsky |Sam07| (see also |Wol09j ) for F 2) and one gets the following. 

Theorem 13.2 ([/ 3 (F2)-inverse theorem). Suppose that f e //^(Fg) has \\f\\u 3 (¥ n ) ^ 
^l/IU 00 ^?)- Then there is a quadratic polynomial q : F 2 — > F 2 , meaning a map x 1— > x ■ Ax 
where A is an upper triangular matrix F 2 — » F 2 , sttc/i #icrf 

Kf, (~i) q )mvi)\ > ex P (-o(iog 3+ °( 1 ) r 1 ))!/!^^,. 

This is much harder to prove than the £/ 2 (F2)-inverse theorem and there is actually a 
close relationship between this and Marton's conjecture. Indeed, Green and Tao in [GTlOj 
and Lovett in |LovlOj showed that Marton's conjecture for F 2 is equivalent to the following. 

Conjecture 13.3 (Polynomial £/ 3 (F2)-inverse conjecture). Suppose that f e L 00 ^^) has 
\\f\\u 3 (F2) 5 s ^II/Hl^f™)- Then there is a quadratic polynomial q : F2 — > F 2 such that 

Kf, (-1) V(fj)| > exp(-0(lo g r 1 ))||/|| L » (F , ) . 

If true this would bring the £/ 3 (F2)-inverse state of affairs in line with the U 2 situation. 

Again, the analogue of the [/ 3 (F2)-inverse theorem for the group G = Z/iVZ can be used 
to give a proof of Szemeredi's theorem for progressions of length four, and, of course, there 
are higher analogues called ^7 fc -norms for longer progressions but again we do not discuss 
this here. 

Long arithmetic progressions in sumsets. The question of finding long arithmetic 
progressions in sets of integers is one of central interest in additive combinatorics. The 
basic question has the following form: suppose that Ai,...,Ak a {l,...,iV} all have 
density at least a. How long an arithmetic progression can we guarantee that A\ + • • ■ + A^ 
contains? 

For one set this is addressed by the notoriously difficult Szemeredi's theorem [Sze69l 
ISze75] where the best quantitative work is that of Gowers |Gow98[ IGowOl] (as mentioned 
in the previous subsection); for two sets the longest progression is much longer with the 
state of the art due to Green |Gre02] (see also Croot and Sisask [CS10] ); for three sets 
or more the results get even stronger with the work of Freiman, Halberstam and Ruzsa 
|FHR92j ; and finally for eight sets or more, longer again by the recent work of Schoen 
jSchTT] . 
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The ideas around theorem II .41 (see |SanlOj ) can be used to give an improvement for four 
sets or more, and in particular we have the following theorem. 

Theorem 13.4. Suppose that A±, . . . , A4 a {1, . . . , iV} all have density at least a. Then 
Ai + ■ ■ ■ + A4 contains an arithmetic progression of length N°( l ° s ° (1)2a 1 ) . 

A(4)-estimate for the squares. A wonderful conjecture of Rudin |Rud60j asserts that 
the squares are a A(4)-set. In symbols this is the following conjecture. 

Conjecture 13.5. Suppose that n\, . . . , are natural numbers. Then 



J 



k 

y exp(27rm^i 



i=l 



1 



de = o(k 



2+o(l)> 



Inserting ideas around Theorem 11.41 (see |SanlOj ) into the work of |Cha04j (itself devel- 
oped from an argument of Bourgain in |JL01] ) yield the following result 

Theorem 13.6. Suppose that nx, . . . ,n& are natural numbers. Then 



k 

y exp(27rm^ 



1 



dO = 0(k 3 exp(~n(\og Q{1) 2k))). 



This is essentially equivalent to inserting Theorem II .41 into the proof of [Schlll Theorem 
8] and Gowers' [Gow98j version of the Balog-Szemeredi Lemma |BS94j . Of course, this is 
far form Rudin's conjecture but it is still the best known result at this time. 

The Konyagin-Laba theorem. Ideas around Theorem 11.41 (see |SanlOj ) inserted into 
the argument at the end of |Schll] yield the following quantitative improvement to a result 
from [KL06j . 

Theorem 13.7 (Konyagin-Laba theorem). Suppose that A is a set of reals and a e R is 
transcendental. Then 

\A + a.A\ = exp(fi(log n(1) 2|A|))|A|. 

What is particularly interesting here is that there is a simple construction which shows 
that there are arbitrarily large sets A with \A + a.A\ = exp(0(^/log |A|))|A|. 
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